Persistent System | Data Engineer Interview Experience | 4+ YoE



Round 1: Technical

✅ Tell me about the most recent project you were part of.

✅ Suppose you need to load data from Azure Blob Storage to an SQL database. Explain step-by-step how you would create the pipeline in Azure Data Factory (ADF).

✅ How can you optimize pipelines that are time-consuming?

✅ What are the different types of triggers available in ADF?

✅ 2-3 questions related to Azure Functions.

✅ What is the role of Apache Spark in Databricks?

✅ Write Spark code to read and write data from/to a Parquet file.

✅ Questions related to data governance in Databricks.

✅ 3-4 scenario-based questions on functions and stored procedures in SQL.

✅ Questions related to views and materialized views in SQL.

✅ Questions related to indexing in SQL.

✅ 4-5 questions related to Power BI.

✅ Can CTEs (Common Table Expressions) be used anywhere in an SQL script?

Round 2: Techno Managerial

✅ Tell me about the most recent project you were part of, including your roles and responsibilities.

✅ Questions related to normalization, primary key, unique key, and composite keys.

✅ What is indexing, and how is indexing used in SQL?

✅ How can you fine-tune an SQL query for better performance?

✅ I would like you to write an SQL query for a given scenario.

✅ Write a PySpark code to process the following tables:

Emp table: Emp id, name, salary, location

Table 2: Empid,emp_designation

Output: Empid, emp_name,emp_designation

Round 3: HR

✅ Tell me about yourself

✅ Why did I leave my previous organization

✅ Why I want to join their organization

✅ How soon I can join and preferred location